A framework for modelling overdispersed count data, including the Poisson-shifted generalized inverse Gaussian distribution
نویسندگان
چکیده
A variety of methods of modelling overdispersed count data are compared. The methods are classified into threemain categories. The first category are ad hoc methods (i.e. pseudolikelihood, (extended) quasi-likelihood, double exponential family distributions). The second category are discretized continuous distributions and the third category are observational level random effects models (i.e. mixture models comprising explicit and non-explicit continuous mixture models and finite mixture models). The main focus of the paper is a family of mixed Poisson distributions defined so that its mean μ is an explicit parameter of the distribution. This allows easier interpretation when μ is modelled using explanatory variables and provides a more orthogonal parameterization to ease model fitting. Specific three parameter distributions considered are the Sichel and Delaporte distributions. A new four parameter distribution, the Poisson-shifted generalized inverse Gaussian distribution is introduced, which includes the Sichel and Delaporte distributions as a special and a limiting case respectively. A general formula for the derivative of the likelihood with respect to μ, applicable to the whole family of mixed Poisson distributions considered, is given. Within the framework introduced here all parameters of the distributions aremodelled as parametric and/or nonparametric (smooth) functions of explanatory variables. This provides a very flexible way of modelling count data.Maximum (penalized) likelihood estimation is used to fit the (non)parametricmodels. © 2008 Elsevier B.V. All rights reserved.
منابع مشابه
A simulation study comparing likelihood and non-likelihood approaches in analyzing overdispersed count data
Overdispersed count data are modelled with likelihood and non-likelihood approaches. Likelihood approaches include the Poisson mixtures with three distributions, the gamma, the lognormal, and the inverse Gaussian distributions. Non-likelihood approaches include the robust sandwich estimator and quasilikelihood. In this simulation study, overdispersed count data were simulated under the Poisson ...
متن کاملBayesian paradigm for analysing count data in longitudina studies using Poisson-generalized log-gamma model
In analyzing longitudinal data with counted responses, normal distribution is usually used for distribution of the random efffects. However, in some applications random effects may not be normally distributed. Misspecification of this distribution may cause reduction of efficiency of estimators. In this paper, a generalized log-gamma distribution is used for the random effects which includes th...
متن کاملQuantile regression for overdispersed count data: a hierarchical method
Generalized Poisson regression is commonly applied to overdispersed count data, and focused on modelling the conditional mean of the response. However, conditional mean regression models may be sensitive to response outliers and provide no information on other conditional distribution features of the response. We consider instead a hierarchical approach to quantile regression of overdispersed c...
متن کاملBayesian generalized additive models for location, scale and shape for zero-inflated and overdispersed count data
Frequent problems in applied research that prevent the application of the classical Poisson log-linear model for analyzing count data include overdispersion, an excess of zeros compared to the Poisson distribution, correlated responses, as well as complex predictor structures comprising nonlinear effects of continuous covariates, interactions or spatial effects. We propose a general class of Ba...
متن کاملParameter Estimation in Spatial Generalized Linear Mixed Models with Skew Gaussian Random Effects using Laplace Approximation
Spatial generalized linear mixed models are used commonly for modelling non-Gaussian discrete spatial responses. We present an algorithm for parameter estimation of the models using Laplace approximation of likelihood function. In these models, the spatial correlation structure of data is carried out by random effects or latent variables. In most spatial analysis, it is assumed that rando...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 53 شماره
صفحات -
تاریخ انتشار 2008